SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Davis , Roger J. 

Flavell, Richard A. 
Rakic , Pasko 
Whitmarsh, Alan 
Kuan, Chia-Yi 
Yang, Di 

(ii) TITLE OF INVENTION: JNK3 MODULATORS AND METHODS OF USE 

(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/165,522 

(B) FILING DATE: 02-OCT-1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/060,995 

(B) FILING DATE: 03-OCT-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fasse, J* Peter 

(B) REGISTRATION NUMBER: 32,983 

(C) REFERENCE/DOCKET NUMBER: 10363/005001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 68... 1459 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTATGCAAGA AACTGTTGAA TTAGACCCGT TTCCTATAGA TGAGAAACCA TACAAGCTGT 
GGTATTT ATG AGC CTC CAT TTC TTA TAC TAC TGC AGT GAA CCA ACA TTG 
Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu 
1 5 10 

GAT GTG AAA ATT GCC TTT TGT CAG GGA TTC GAT AAA CAA GTG GAT GTG 
Asp Val Lys He Ala Phe Cys Gin Gly Phe Asp Lys Gin Val Asp Val 
15 20 25 30 

TCA TAT ATT GCC AAA CAT TAC AAC ATG AGC AAA AGC AAA GTT GAC AAC 
Ser Tyr He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn 
35 40 45 

CAG TTC TAC AGT GTG GAA GTG GGA GAC TCA ACC TTC ACA GTT CTC AAG 
Gin Phe Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys 
50 55 60 

CGC TAC CAG AAT CTA AAG CCT ATT GGC TCT GGG GCT CAG GGC ATA GTT 
Arg Tyr Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val 
65 70 75 

TGT GCC GCG TAT GAT GCT GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG 
Cys Ala Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys 
80 85 90 

CTC AGC AGA CCC TTT CAG AAC CAA ACA CAT GCC AAG AGA GCG TAC CGG 
Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg 
95 100 105 HO 

GAG CTG GTC CTC ATG AAG TGT GTG AAC CAT AAA AAC ATT ATT AGT TTA 
Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu 
115 120 125 

TTA AAT GTC TTC ACA CCC CAG AAA ACG CTG GAG GAG TTC CAA GAT GTT 
Leu Asn Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val 
130 135 140 

TAC TTA GTA ATG GAA CTG ATG GAT GCC AAC TTA TGT CAA GTG ATT CAG 
Tyr Leu Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin 
145 150 155 

ATG GAA TTA GAC CAT GAG CGA ATG TCT TAC CTG CTG TAC CAA ATG TTG 
Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu 
160 165 170 

TGT GGC ATT AAG CAC CTC CAT TCT GCT GGA ATT ATT CAC AGG GAT TTA 
Cvs Gly He Lys His Leu His Ser Ala Gly He He His Arg Asp Leu 
175 180 185 190 

AAA CCA AGT AAC ATT GTA GTC AAG TCT GAT TGC ACA TTG AAA ATC CTG 
Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu 
195 200 205 

GAC TTT GGA CTG GCC AGG ACA GCA GGC ACA AGC TTC ATG ATG ACT CCA 
Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro 
210 215 220 

TAT GTG GTG ACA CGT TAT TAC AGA GCC CCT GAG GTC ATC CTG GGG ATG 
Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met 
225 * 230 235 



60 
109 



157 



205 



253 



301 



349 



397 



445 



493 



541 



589 



637 



685 



733 



781 
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GGC TAC AAG GAG AAC GTG GAT ATA TGG TCT GTG GGA TGC ATT ATG GGA 829 
Gly Tyr Lys Glu Asn Val Asp lie Trp Ser Val Gly Cys lie Met Gly 
240 245 250 

GAA ATG GTT CGC CAC AAA ATC CTC TTT CCA GGA AGG GAC TAT ATT GAC 877 
Glu Met Val Arg His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp 
255 " 260 265 270 

CAG TGG AAT AAG GTA ATT GAA CAA CTA GGA ACA CCA TGT CCA GAA TTC 925 
Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe 
275 280 285 

ATG AAG AAA TTG CAA CCC ACA GTA AGA AAC TAT GTG GAG AAT CGG CCC 973 
Met Lys Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro 
290 295 300 

AAG TAT GCG GGA CTC ACC TTC CCC AAA CTC TTC CCA GAT TCC CTC TTC 1021 
Lys Tyr Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe 
305 310 315 

CCA GCG GAC TCC GAG CAC AAT AAA CTC AAA GCC AGC CAA GCC AGG GAC 1069 
Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp 
320 ~ 325 330 

TTG TTG TCA AAG ATG CTA GTG ATT GAC CCA GCA AAA AGA ATA TCA GTG 1117 
Leu Leu Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val 
335 ~ 340 345 350 

GAC GAC GCC TTA CAG CAT CCC TAC ATC AAC GTC TGG TAT GAC CCA GCC 1165 
Asp Asp Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala 
355 360 365 

GAA GTG GAG GCG CCT CCA CCT CAG ATA TAT GAC AAG CAG TTG GAT GAA 1213 
Glu Val Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu 
370 375 380 

AGA GAA CAC ACA ATT GAA GAA TGG AAA GAA CTT ATC TAC AAG GAA GTA 1261 
Arg Glu His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val 
385 390 395 

ATG AAT TCA GAA GAA AAG ACT AAA AAT GGT GTA GTA AAA GGA CAG CCT 1309 
Met Asn Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Gly Gin Pro 
400 405 410 

TCT CCT TCA GGT GCA GCA GTG AAC AGC AGT GAG AGT CTC CCT CCA TCC 1357 
Ser Pro Ser Gly Ala Ala Val Asn Ser Ser Glu Ser Leu Pro Pro Ser 
415 420 425 430 

TCG TCT GTC AAT GAC ATC TCC TCC ATG TCC ACC GAC CAG ACC CTG GCA 1405 
Ser Ser Val Asn Asp He Ser Ser Met Ser Thr Asp Gin Thr Leu Ala 
435 440 445 

TCT GAC ACT GAC AGC AGC CTG GAA GCC TCG GCA GGA CCC CTG GGT TGT 1453 
Ser Asp Thr Asp Ser Ser Leu Glu Ala Ser Ala Gly Pro Leu Gly Cys 
450 455 460 

TGC AGG TGACTAGCCG CCTGCCTGCG AAACCCAGCG TTCTTCAGGA GATGAT 1505 
Cys Arg 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val 

1 5 10 15 

Lys He Ala Phe Cys Gin Gly Phe Asp Lys Gin Val Asp Val Ser Tyr 

20 25 30 

He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gin Phe 

35 40 45 

Tvr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr 

50 55 60 

Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val Cys Ala 
65 70 75 80 

Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser 

85 90 95 

Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 

100 105 11° 

Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu Leu Asn 

115 120 125 

Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 

130 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu 
145 150 155 160 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly 

165 170 175 

He Lys His Leu His Ser Ala Gly He He His Arg Asp Leu Lys Pro 

180 185 190 

Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe 

195 200 205 

Glv Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 

210 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr 
225 230 235 240 

Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly Glu Met 

245 250 255 

val Arg His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp 

260 265 270 

Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 

275 280 285 

Lvs Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 

290 295 300 

Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala 
305 310 315 320 

Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu 

325 330 335 

Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val Asp Asp 

340 345 350 

Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val 

355 360 365 

Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu 
370 375 380 
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His Thr 


He 


Glu 


Glu 


Trp Lys 


Glu 


385 








390 




Ser Glu 


Glu 


Lys 


Thr 


Lys Asn Gly 
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Ala 
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Val 


Asn Ser 
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420 








Val Asn 


Asp 
435 


He 


Ser 


Ser Met 


Ser 
440 


Thr Asp 


Ser 


Ser 


Leu 


Glu Ala 


Ser 


450 
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Leu 


He 


Tyr 


Lys Glu Val 


Met Asn 






395 


400 


Val 


Val 


Lys 


Gly Gin Pro 


Ser Pro 




410 




415 


Glu 


Ser 


Leu 


Pro Pro Ser 


Ser Ser 


425 






430 




Thr 


Asp 


Gin 


Thr Leu Ala 


Ser Asp 






445 




Ala 


Gly Pro 


Leu Gly Cys Cys Arg 



460 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAGAAATGGC GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 60 

CCCCTGTATT TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 120 

ScaSgc CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 180 

ACCCOTTTCC tatagatgag aaaccataca AGCTGTGGTA TTTATGAGCC TCCATTTCTT 240 

AGTGAACCAA CATTGGATGT GAAAATTGCC TTTTGTCAGG OATTCGATAA 300 

ACAAGTGGAT GTGTCATATA TTGCCAAACA TTACAACATG AGCAAAAGCA AAGTTGACAA 360 

S^TCTAC AGTGTGGAAG TGGGAGACTC AACCTTCACA GTTCTCAAGC GCTACCAGAA 420 

TCTAAAGCCT ATTGGCTCTG GGGCTCAGGG CATAGTTTGT GCCGCGTATG ATGCTGTCCT 480 

tcIcmaaat GTGGCCATTA AGAAGCTCAG CAGACCCTTT CAGAACCAAA CACATGCCAA 540 

GAGAGCGTAC CGGGAGCTGG TCCTCATGAA GTGTGTGAAC CATAAAAACA TTATTAGTTT 600 

ATTAAATGTC TTCACACCCC AGAAAACGCT GGAGGAGTTC CAAGATGTTT ACTTAGTAAT 660 

GGAACTGATG GATGCCAACT TATGTCAAGT GATTCAGATG GAATTAGACC ATGAGCGAAT 720 

GTCTTACCTG CTGTACCAAA TGTTGTGTGG CATTAAGCAC CTCCATTCTG CTGGAATTAT 780 

?caSgSat TTAAAACCAA GTAACATTGT AGTCAAGTCT GATTGCACAT TGAAAATCCT 840 

g^aSSSa cSScSagga cagcaggcac aagcttcatg atgactccat atgtggtgac 900 

ACGTTATTAC AGAGCCCCTG AGGTCATCCT GGGGATGGGC TACAAGGAGA ACGTGGATAT 960 

SSgtc goatgcatta tgggagaaat GGTTCGCCAC aaaatcctct ttccaggaag 1020 

GGACTATATT GACCAGTGGA ATAAGGTAAT TGAACAACTA GGAACACCAT GTCCAGAATT 1080 

CATGAAGAAA TTGCAACCCA CAGTAAGAAA CTATGTGGAG AATCGGCCCA AGTATGCGGG 1140 

MTCACCTTC CCCAAACTCT TCCCAGATTC CCTCTTCCCA GCGGACTCCG AGCACAATAA 1200 

MTCAAAGCC AGCCAAGCCA GGGACTTGTT GTCAAAGATG CTAGTGATTG ACCCAGCAAA 1260 

aStIScA GTGGACGACG CCTTACAGCA TCCCTACATC AACGTCTGGT ATGACCCAGC 1320 

SSgtggag gcgcctccac ctcagatata tgacaagcag TTGGATGAAA GAGAACACAC 1380 

AATTGAAGAA TGGAAAGAAC TTATCTACAA GGAAGTAATG AATTCAGAAG AAAAGACTAA 1440 

A^ATGG^gS GTAAAAGGAC AGCCTTCTCC TTCAGGTGCA GCAGTGAACA GCAGTGAGAG 1500 

TCTCCCTCCA TCCTCGTCTG TCAATGACAT CTCCTCCATG TCCACCGACC AGACCCTGGC 1560 

aSSS GACAgSSc TGGAAGCCTC GGCAGGACCC CTGGGTTGTT GCAGGTGACT 1620 

AGCCGCCTGC CTGCGAAACC CAGCGTTCTT CAGGAGATGA TGTGATGGAA CACACACACA 1680 

ScSaCACA CACACACACA CAAATGCAGA CACACAACAT CAAGAAAACA GCAAGGGAGA 1740 

CARTCCAAGC CTAAAATTAA ATAAATCTTT CAGCCTGCTT CTTCCCCAGG GTTCTGTATT 1800 

GcISSgC 5cAAATGTAT ATTTAACTTC TAGTTGCTCT TGCTTTGGTC TTCTTCCAAT 1860 

GAtcStIS ACAGAAAGCA AATCAGACAC AATTAGAGAA GCCTTTTCCA TAAAGTGTAA 1920 

TTTTAATGGC TGCAAAACCG GCAACCTGTA ACTGCCCTTT TAAATGGCAT GACAAGGTGT 1980 

gSgTCGCCC CATCCAGCAT GTGTGTGTCT CTATCTTGCA TCTACCTGCT CCTTGGCCTA 2040 

g5SgSg?a TCTAGATACA GATCCGCATG TGTCTGTATT CATACAGCAC TACTTACTTA 2100 

Stgctac Stcagtgtc CTCAGGGCTC TACCAAGACA TAATGCACTG GGGTACCACA 2160 

?g1?S?5 Stotgatct ATTACTCTGA CATAAACCCA TCTGTAATAT ATTGCCAGTA 2220 

TTAGTTTGTT AATTGATTAA ACTGTATGTC TTATAAGAAA ACATGTAAAG 2280 
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GGGGAATATA TTGGGGGAGT GAGCTCTCTC AGACCCTTGA AGATGTAGCT TCCAAATTTG 
AATGGATTAA ATGGCACCTG TATACCA 



2340 
2367 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 92... 1357 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATCTCAGATC TTCACTATGG CAACTTATGC AAGAAACTGT TGAATTAGAC CCGTTTCCTA 60 
TAGATGAGAA ACCATACAAG CTGTGGTATT T ATG AGC CTC CAT TTC TTA TAC 112 

Met Ser Leu His Phe Leu Tyr 
1 5 

TAC TGC AGT GAA CCA ACA TTG GAT GTG AAA ATT GCC TTT TGT CAG GGA 160 
Tvr Cys Ser Glu Pro Thr Leu Asp Val Lys He Ala Phe Cys Gin Gly 
10 15 20 

TTC GAT AAA CAA GTG GAT GTG TCA TAT ATT GCC AAA CAT TAC AAC ATG 208 
Phe Asp Lys Gin Val Asp Val Ser Tyr He Ala Lys His Tyr Asn Met 
25 30 35 

AGC AAA AGC AAA GTT GAC AAC CAG TTC TAC AGT GTG GAA GTG GGA GAC 256 
Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val Gly Asp 
* 45 50 55 



40 



TCA ACC TTC ACA GTT CTC AAG CGC TAC CAG AAT CTA AAG CCT ATT GGC 304 
Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro lie Gly 
60 65 70 

TCT GGG GCT CAG GGC ATA GTT TGT GCC GOG TAT GAT GCT GTC CTT GAC 352 
Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val Leu Asp 

80 85 



75 



400 



AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTT CAG AAC CAA ACA 
Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr 
90 95 100 

CAT GCC AAG AGA GCG TAC CGG GAG CTG GTC CTC ATG AAG TGT GTG AAC 448 
His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val Asn 
105 HO 115 

CAT AAA AAC ATT ATT AGT TTA TTA AAT GTC TTC ACA CCC CAG AAA ACG 496 
His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin Lys Thr 
120 125 130 135 

CTC GAG GAG TTC CAA GAT GTT TAC TTA GTA ATG GAA CTC ATG GAT GCC 544 
Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met Asp Ala 
140 145 150 
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AAC TTA TGT CAA GTG ATT CAG ATG GAA TTA GAC CAT GAG CGA ATG TCT 592 
Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met Ser 
155 160 165 

TAC CTG CTG TAC CAA ATG TTG TGT GGC ATT AAG CAC CTC CAT TCT GCT 640 
Tvr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser Ala 
170 175 180 

GGA ATT ATT CAC AGG GAT TTA AAA CCA AGT AAC ATT GTA GTC AAG TCT 688 
Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys Ser 
185 190 195 

GAT TGC ACA TTG AAA ATC CTG GAC TTT GGA CTG GCC AGG ACA GCA GGC 736 
Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala Gly 
200 205 210 215 

ACA AGC TTC ATG ATG ACT CCA TAT GTG GTG ACA CGT TAT TAC AGA GCC 784 
Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala 
220 225 230 

CCT GAG GTC ATC CTG GGG ATG GGC TAC AAG GAG AAC GTG GAT ATA TGG 832 
Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp He Trp 
235 240 245 

TCT GTG GGA TGC ATT ATG GGA GAA ATG GTT CGC CAC AAA ATC CTC TTT 880 
Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He Leu Phe 
250 255 260 

CCA GGA AGG GAC TAT ATT GAC CAG TGG AAT AAG GTA ATT GAA CAA CTA 928 
Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin Leu 
265 " 270 275 

GGA ACA CCA TGT CCA GAA TTC ATG AAG AAA TTG CAA CCC ACA GTA AGA 976 
Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val Arg 
280 285 290 295 

AAC TAT GTG GAG AAT CGG CCC AAG TAT GCG GGA CTC ACC TTC CCC AAA 1024 
Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe Pro Lys 
300 305 310 

CTC TTC CCA GAT TCC CTC TTC CCA GCG GAC TCC GAG CAC AAT AAA CTC 1072 
Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn Lys Leu 
315 320 325 

AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG CTA GTG ATT GAC 1120 
Lvs Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He Asp 
330 335 340 

CCA GCA AAA AGA ATA TCA GTG GAC GAC GCC TTA CAG CAT CCC TAC ATC 1168 
Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro Tyr He 
345 350 355 

AAC GTC TGG TAT GAC CCA GCC GAA GTG GAG GCG CCT CCA CCT CAG ATA 1216 
Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro Gin lie 
360 365 370 375 

TAT GAC AAG CAG TTG GAT GAA AGA GAA CAC ACA ATT GAA GAA TGG AAA 1264 
Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp Lys 
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GAA CTT ATC TAC AAG GAA GTA ATG AAT TCA GAA GAA AAG ACT AAA AAT 1312 
G^ Leu lie Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr Lys Asn 



395 



GGT GTA GTA AAA GGA GAG CCT TCT CCT TCA GCA CAG GTG CAG CAG 
Gly Val Val Lys Gly Gin Pro Ser Pro Ser Ala Gin Val Gin Gin 
410 415 

TGAACAGCAG TGAGAGTCTC CCTCCATCCT CGTCTGTCAA TGACATCTCC TCCATGTCCA 
JSJcCAGAC CCTGGCATCT GACACTGACA GCAGCCTGGA AGCCTCGGCA GGACCCCTGG 

g^SSg gSSg c gcctgcctgc gaaacccagc gttcttcagg agatgatgtg 
aJSIacaca cacacacgca gacacacaca cacacacaaa tgcagacaca caacatcaag 
SSSSS ccaagcctaa aattaaataa atctttcagc ctccttcttc 

CCCAGGGTTC TGTATTGCAG CTAAGCTCAA ATGTATATTT AACTTCTAGT TGCTCTTGCT 
SSSctot TCCAATGATG CTTACTACAG AAAGCAAATC agacacaatt agagaa 



1357 



1417 
1477 
1537 
1597 
1657 
1717 
1773 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val 

Lys lie Ala Phe Cys Gin Gly Phe Asp Lys Gin Val Asp Val Ser Tyr 

20 25 JU 

lie Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gin Phe 

35 40 4 

Tvr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr 

50 55 
Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val Cys Ala 

fic 70 '5 

Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala lie Lys Lys Leu Ser 
35 90 * 3 



35 40 45 

Tvr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr 
50 55 60 m _ _ 

Asn Leu Lys Pro He G 
70 

Tyr Asp Ala Val Leu A 
85 

Pro Phe Gin Asn Gin I 
100 

Leu Met Lys Cys Val 3 
115 

Phe Thr Pro Gin Lys 1 
130 3 
Met Glu Leu Met Asp I 

Leu Asp His Glu Arg Me£ Ser Tyr Leu Leu iyr Gin Met Leu Cys Gly 
165 I 70 



Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 

100 105 110 

Val Leu Met Lys Cys Val Asn His Lys Asn lie lie Ser Leu Leu Asn 

120 I 25 
Thr Leu Glu Glu Phe Gin Asp Val 
Yin 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu 

150 155 



115 ~ 120 125 
Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 
nc 140 



lie Lys His Leu His Ser Ala Gly He He His Arg Asp Leu Lys Pro 
180 185 190 



Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 



180 10 j 

Ser Asn lie Val Val Lys Ser Asp Cys Thr Leu Lys lie Leu Asp Phe 

200 205 
_ Thr Ser Phe 
215 220 
Pro Glu Val 

Ser Val Gly 
245 " * 250 



Val tt£ Arg Tyr Tyr Arg Ala Pro Glu Val lie Leu Gly Met Gly Tyr 

230 2,35 



Lys Glu Asn Val Asp lie Trp Ser Val Gly Cys lie Met Gly Glu Met 
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Val 
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His 


Lys 


He 


Leu 


Phe 


Pro 


Gly 


7\ 

Arg 


ASp 


iyr lie Asp uin 


Trp 






260 










2ob 






970 
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Lys 


Val 


He 


Glu 


Gin 


Leu 


Gly 


Tnr 


Fro 


Cys 


xtjlO vlu ir tits neu 


T.vfl 

J-i jr O 
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AOS 




Lys 


Leu 


Gin 


Pro 


Thr 


Val 


Arg 


Asn 


Tyr 


val 


GlU 


Asn Arg rro i-»y b 


ryr 


290 










one 
29b 












Ala 


Ala 


Gly 


Leu 


Thr 


Phe 


Pro 


Lys 


Leu 


Pne 


Pro 


ASp 


del lieU rue riU 


305 








310 










J1D 




320 


Asp 


Ser 


Glu 


His 


Asn 


Lys 


Leu 


Lys 


Ala 


Ser 


Gin 


Ala Arg Asp Lieu 


Leu 








325 








330 




335 




Ser 


Lys 


Met 


Leu 


Val 


He 


Asp 


Pro 


» 1 mm 

Ala 


Lys 


Arg 


lie £>er vai Asp 


Tv on 
nop 






340 










345 








Vol 


Ala 


Leu 


Gin 


His 


Pro 


Tyr 


He 


Asn 


Val 


Trp 


Tyr 


ASp rfO AX a tflu 






355 








o c r\ 

360 










Glu 


Glu 


Ala 
370 


Pro 


Pro 


Pro 


Gin 


lie 
375 
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Trp 
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390 
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400 


Ser 


Glu 
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Lys 
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Lys 


Asn 
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Val 


Val 


Lys 


Gly Gin Pro Ser 


Pro 
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410 




415 




Ser 


Ala 


Gin 


Val 
420 


Gin 


Gin 

















(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GAGAAATGGC GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 60 

CCCCTGTATT TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 120 

AACAGCAGGC CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 180 

ACCCGTTTCC TATAGATGAG AAACCATACA AGCTGTGGTA TTTATGAGCC TCCATTTCTT 240 

ATACTACTGC AGTGAACCAA CATTGGATGT GAAAATTGCC TTTTGTCAGG GATTCGATAA 300 

ACAAGTGGAT GTGTCATATA TTGCCAAACA TTACAACATG AGCAAAAGCA AAGTTGACAA 360 

CCAGTTCTAC AGTGTGGAAG TGGGAGACTC AACCTTCACA GTTCTCAAGC GCTACCAGAA 420 

TCTAAAGCCT ATTGGCTCTG GGGCTCAGGG CATAGTTTGT GCCGCGTATG ATGCTGTCCT 480 

TGACAGAAAT GTGGCCATTA AGAAGCTCAG CAGACCCTTT CAGAACCAAA CACATGCCAA 540 

GAGAGCGTAC CGGGAGCTGG TCCTCATGAA GTGTGTGAAC CATAAAAACA TTATTAGTTT 600 

ATTAAATGTC TTCACACCCC AGAAAACGCT GGAGGAGTTC CAAGATGTTT ACTTAGTAAT 660 

GGAACTGATG GATGCCAACT TATGTCAAGT GATTCAGATG GAATTAGACC ATGAGCGAAT 720 

GTCTTACCTG CTGTACCAAA TGTTGTGTGG CATTAAGCAC CTCCATTCTG CTGGAATTAT 780 

TCACAGGGAT TTAAAACCAA GTAACATTGT AGTCAAGTCT GATTGCACAT TGAAAATCCT 840 

GGACTTTGGA CTGGCCAGGA CAGCAGGCAC AAGCTTCATG ATGACTCCAT ATGTGGTGAC 900 

ACGTTATTAC AGAGCCCCTG AGGTCATCCT GGGGATGGGC TACAAGGAGA ACGTGGATAT 960 

ATGGTCTGTG GGATGCATTA TGGGAGAAAT GGTTCGCCAC AAAATCCTCT TTCCAGGAAG 1020 

GGACTATATT GACCAGTGGA ATAAGGTAAT TGAACAACTA GGAACACCAT GTCCAGAATT 1080 

CATGAAGAAA TTGCAACCCA CAGTAAGAAA CTATGTGGAG AATCGGCCCA AGTATGCGGG 1140 

ACTCACCTTC CCCAAACTCT TCCCAGATTC CCTCTTCCCA GCGGACTCCG AGCACAATAA 1200 

ACTCAAAGCC AGCCAAGCCA GGGACTTGTT GTCAAAGATG CTAGTGATTG ACCCAGCAAA 1260 

AAGAATATCA GTGGACGACG CCTTACAGCA TCCCTACATC AACGTCTGGT ATGACCCAGC 1320 

CGAAGTGGAG GCGCCTCCAC CTCAGATATA TGACAAGCAG TTGGATGAAA GAGAACACAC 1380 

AATTGAAGAA TGGAAAGAAC TTATCTACAA GGAAGTAATG AATTCAGAAG AAAAGACTAA 1440 

AAATGGTGTA GTAAAAGGAC AGCCTTCTCC TTCAGCACAG GTGCAGCAGT GAACAGCAGT 1500 

GAGAGTCTCC CTCCATCCTC GTCTGTCAAT GACATCTCCT CCATGTCCAC CGACCAGACC 1560 

CTGGCATCTG ACACTGACAG CAGCCTGGAA GCCTCGGCAG GACCCCTGGG TTGTTGCAGG 1620 
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TGACTAGCCG 
ACACAC6CAG 
GCAGA6AATC 
GTATTGCAGC 
CCAATGATGC 
TGTAATTTTA 
GGTGTGCAGT 
GCCTAGTCAG 
ACTTAGAGAT 
CCACATGGTC 
CAGTATATAA 
TAAAGGGGGA 
ATTTGAATGG 



CCTGCCTGCG 
ACACACACAC 
CAAGCCTAAA 
TAAGCTCAAA 
TTACTACAGA 
ATGGCTGCAA 
GGCCCCATCC 
ATGGATGTAG 
GCTACTCTCA 
CATTTCATGT 
GCTGTTTAGT 
ATATATTGGG 
ATTAAATGGC 



AAACCCAGCG 
ACACACAAAT 
ATTAAATAAA 
TGTATATTTA 
AAGCAAATCA 
AACCGGCAAC 
AGCATGTGTG 
ATACAGATCC 
GTGTCCTCAG 
GATCTATTAC 
TTGTTAATTG 
GGAGTGAGCT 
ACCTGTATAC 



TTCTTCAGGA 
GCAGACACAC 
TCTTTCAGCC 
ACTTCTAGTT 
GACACAATTA 
CTGTAACTGC 
TGTCTCTATC 
GCATGTGTCT 
GGCTCTACCA 
TCTGACATAA 
ATTAAACTGT 
CTCTCAGACC 
CA 



GATGATGTGA 
AACATCAAGA 
TGCTTCTTCC 
GCTCTTGCTT 
GAGAAGCCTT 
CCTTTTAAAT 
TTGCATCTAC 
GTATTCATAC 
AGACATAATG 
ACCCATCTGT 
ATGTCTTATA 
CTTGAAGATG 



TGGAACACAC 
AAACAGCAAG 
CCAGGGTTCT 
TGGTCTTCTT 
TTCCATAAAG 
GGCATGACAA 
CTGCTCCTTG 
AGCACTACTT 
CACTGGGGTA 
AATATATTGC 
AGAAAACATG 
TAGCTTCCAA 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2372 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 224... 1489 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GAGAAATGGC 
CCCCTGTATT 
AACAGCAGGC 
ACCCGTTTCC 



GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 
TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 
CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 
TATAGATGAG AAACCATACA AGCTGTGGTA TTT ATG AGC CTC CAT 

Met Ser Leu His 
1 



TTC TTA TAC TAC TGC AGT GAA CCA ACA TTG GAT GTG AAA ATT GCC TTT 
Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val Lys He Ala Phe 
5 ~ 10 15 20 

TGT CAG GGA TTC GAT AAA CAA GTG GAT GTG TCA TAT ATT GCC AAA CAT 
Cys Gin Gly Phe Asp Lys Gin Val Asp Val Ser Tyr He Ala Lys His 
25 30 35 

TAC AAC ATG AGC AAA AGC AAA GTT GAC AAC CAG TTC TAC AGT GTG GAA 
Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu 
40 45 50 

GTG GGA GAC TCA ACC TTC ACA GTT CTC AAG CGC TAC CAG AAT CTA AAG 
Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys 
55 60 65 

CCT ATT GGC TCT GGG GCT CAG GGC ATA GTT TGT GCC GCG TAT GAT GCT 
Pro He Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala 
70 * 75 80 

GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTT CAG 
Val Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin 
85 " 90 95 100 



60 
120 
180 
235 



283 



331 



379 



427 



475 



523 
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AAC CAA ACA CAT GCC AAG AGA GCG TAC CGG GAG CTG GTC CTC ATG AAG 571 
Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys 
105 HO 115 

TGT GTG AAC CAT AAA AAC ATT ATT AGT TTA TTA AAT GTC TTC ACA CCC 619 
Cys Val Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro 
120 125 130 

CAG AAA ACG CTG GAG GAG TTC CAA GAT GTT TAC TTA GTA ATG GAA CTG 667 
Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu 
135 140 145 

ATG GAT GCC AAC TTA TGT CAA GTG ATT CAG ATG GAA TTA GAC CAT GAG 715 
Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu 
150 155 160 

CGA ATG TCT TAC CTG CTG TAC CAA ATG TTG TGT GGC ATT AAG CAC CTC 763 
Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu 
165 170 175 180 

CAT TCT GCT GGA ATT ATT CAC AGG GAT TTA AAA CCA AGT AAC ATT GTA 811 
His Ser Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val 
185 190 195 

GTC AAG TCT GAT TGC ACA TTG AAA ATC CTG GAC TTT GGA CTG GCC AGG 859 
Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg 
200 205 210 

ACA GCA GGC ACA AGC TTC ATG ATG ACT CCA TAT GTG GTG ACA CGT TAT 907 
Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr 
215 220 225 

TAC AGA GCC CCT GAG GTC ATC CTG GGG ATG GGC TAC AAG GAG AAC GTG 955 
Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val 
230 235 240 

GAT ATA TGG TCT GTG GGA TGC ATT ATG GGA GAA ATG GTT CGC CAC AAA 1003 
Asp He Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys 
245 250 255 260 

ATC CTC TTT CCA GGA AGG GAC TAT ATT GAC CAG TGG AAT AAG GTA ATT 1051 
He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He 
265 270 275 

GAA CAA CTA GGA ACA CCA TGT CCA GAA TTC ATG AAG AAA TTG CAA CCC 1099 
Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro 
280 285 290 

ACA GTA AGA AAC TAT GTG GAG AAT CGG CCC AAG TAT GCG GGA CTC ACC 1147 
Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr 
295 300 305 

TTC CCC AAA CTC TTC CCA GAT TCC CTC TTC CCA GCG GAC TCC GAG CAC 1195 
Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His 
310 315 320 

AAT AAA CTC AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG CTA 1243 
Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu 
325 330 335 340 
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GTG ATT GAC CCA GCA AAA AGA ATA TCA GTG GAC GAC GCC TTA CAG CAT 1291 
Val lie Asp Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His 
345 350 355 

CCC TAC ATC AAC GTC TGG TAT GAC CCA GCC GAA GTG GAG GCG CCT CCA 1339 
Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro 
360 365 370 

CCT CAG ATA TAT GAC AAG CAG TTG GAT GAA AGA GAA CAC ACA ATT GAA 1387 
Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu 
375 380 385 

GAA TGG AAA GAA CTT ATC TAC AAG GAA GTA ATG AAT TCA GAA GAA AAG 1435 
Glu Trp Lys Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys 
390 395 400 

ACT AAA AAT GGT GTA GTA AAA GGA CAG CCT TCT CCT TCA GCA CAG GTG 1483 
Thr Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Ala Gin Val 
405 410 415 420 

CAG CAG TGAACAGCAG TGAGAGTCTC CCTCCATCCT CGTCTGTCAA TGACATCTCC 1539 
Gin Gin 

TCCATGTCCA CCGACCAGAC CCTGGCATCT GACACTGACA GCAGCCTGGA AGCCTCGGCA 1599 

GGACCCCTGG GTTGTTGCAG GTGACTAGCC GCCTGCCTGC GAAACCCAGC GTTCTTCAGG 1659 

AGATGATGTG ATGGAACACA CACACACGCA GACACACACA CACACACAAA TGCAGACACA 1719 

CAACATCAAG AAAACAGCAA GGGAGAGAAT CCAAGCCTAA AATTAAATAA ATCTTTCAGC 1779 

CTGCTTCTTC CCCAGGGTTC TGTATTGCAG CTAAGCTCAA ATGTATATTT AACTTCTAGT 1839 

TGCTCTTGCT TTGGTCTTCT TCCAATGATG CTTACTACAG AAAGCAAATC AGACACAATT 1899 

AGAGAAGCCT TTTCCATAAA GTGTAATTTT AATGGCTGCA AAACCGGCAA CCTGTAACTG 1959 

CCCTTTTAAA TGGCATGACA AGGTGTGCAG TGGCCCCATC CAGCATGTGT GTGTCTCTAT 2019 

CTTGCATCTA CCTGCTCCTT GGCCTAGTCA GATGGATGTA GATACAGATC CGCATGTGTC 2079 

TGTATTCATA CAGCACTACT TACTTAGAGA TGCTACTCTC AGTGTCCTCA GGGCTCTACC 2139 

AAGACATAAT GCACTGGGGT ACCACATGGT CCATTTCATG TGATCTATTA CTCTGACATA 2199 

AACCCATCTG TAATATATTG CCAGTATATA AGCTGTTTAG TTTGTTAATT GATTAAACTG 2259 

TATGTCTTAT AAGAAAACAT GTAAAGGGGG AATATATTGG GGGAGTGAGC TCTCTCAGAC 2319 

CCTTGAAGAT GTAGCTTCCA AATTTGAATG GATTAAATGG CACCTGTATA CCA 2372 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



1 5 " 10 

aVB He Ala Phe Cys Gin Gly Phe Asp Lys 

20 25 
Cle Ala Lys His Tyr Asn Met Ser Lys Ser 

35 40 
Pyr Ser Val Glu Val Gly Asp Ser Thr Phe 

50 55 
31n Asn Leu Lys Pro He Gly Ser Gly Ala 
65 70 



NO: 8: 




Glu 


Pro 


Thr Leu Asp Val 






15 


Gin 


Val 


Asp Val Ser Tyr 






30 


Lys 


Val 


Asp Asn Gin Phe 




45 


Thr 


Val 


Leu Lys Arg Tyr 




60 




Gin 


Gly 


He Val Cys Ala 


75 


80 
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Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lye Lys Leu Ser 

85 90 95 

Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 

100 105 HO 

Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu Leu Asn 

115 120 125 

Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 

130 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu 
145 150 155 160 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly 

165 170 175 

He Lys His Leu His Ser Ala Gly He He His Arg Asp Leu Lys Pro 

180 185 190 

Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe 

195 200 205 

Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 

210 ~ 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr 
225 230 235 240 

Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly Glu Met 

245 250 255 

Val Arg His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp 

260 265 270 

Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 

275 280 285 

Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 

290 295 300 

Ala Glv Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala 
305 310 315 320 

Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu 

325 330 335 

Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val Asp Asp 

340 345 350 

Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val 

355 360 365 

Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu 

370 375 380 

His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val Met Asn 
385 390 395 400 

Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro 

405 410 415 

Ser Ala Gin Val Gin Gin 
420 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 364. -.1641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCCTCCTTAT TCCGGTTTGG AATGTGGCTA ATGAAAGCCC AGTAGGAGGA TTTCTGGGGC 60 
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AAACAGGTGG ACCAGGATCC TGGTTCTCAG GCACGGAATG GCTATTGTGA GAGCGCCACC 120 

AGCAGGACCA TCGCAGATCT TGGTTATGGC TGCTCACGCA AGAGGCTGTT GATGTAGACC 180 

CCCTTTCCCG TAGATGAGAA ATCACACGAG CAGTGGTATT TATGAGCCTC CATTTCTTAT 240 

ACTACTGCAG TGAACCAACC TTGGATGTGA AAATTGCCTT TTGTCAGGTG TGTGTTCCTT 300 

ACAGGTAAAA CAAAGGGATT CGACAAACAC GTGGATGTGT CTTCTGTTGT CAAACATTAC 360 

AAC ATG AGC AAA AGC AAG GTA GAT AAC CAG TTC TAC AGT GTG GAA GTG 408 
Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val 
15 10 15 

GGA GAC TCA ACC TTC ACA GTT CTA AAG CGC TAC CAG AAC CTG AAG CCG 456 
Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro 
20 25 30 

ATC GGC TCT GGG GCT CAG GGA ATA GTT TGT GCT GCG TAT GAC GCT GTC 504 
He Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val 
35 40 45 

CTC GAC AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTC CAG AAC 552 
Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn 
50 55 60 

CAA ACT CAT GCC AAG AGG GCT TAC CGG GAG CTG GTC CTC ATG AAG TGT 600 
Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys 
65 70 75 

GTG AAC CAT AAA AAC ATT ATT AGC TTA TTA AAT GTC TTT ACA CCC CAG 648 
Val Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin 
80 85 90 95 

AAA ACA CTG GAG GAG TTC CAA GAT GTT TAC TTA GTG ATG GAA CTG ATG 696 
Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met 
100 105 HO 

GAC GCC AAC TTG TGT CAG GTG ATT CAG ATG GAG CTG GAC CAC GAG CGG 744 
Asp Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg 
115 120 125 

ATG TCG TAC TTG CTG TAC CAG ATG CTG TCG GCG ATC AAA CAC CTC CAC 792 
Met Ser Tyr Leu Leu Tyr Gin Met Leu Ser Ala He Lys His Leu His 
130 135 140 

TCC GCT GGG ATC ATC CAC AGG GAC TTA AAA CCC AGT AAC ATC GTA GTC 84C 
Ser Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val 
145 150 155 

AAG TCT GAT TGC ACA CTG AAA ATC CTG GAC TTT GGA CTG GCC AGG ACA 88E 
Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr 
160 " 165 170 175 

GCG GGC ACA AGC TTC ATG ATG ACT CCG TAT GTG GTG ACG AGA TAT TAC 93i 
Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr 
180 185 190 

AGA GCC CCC GAG GTC ATC CTG GGC ATG GGC TAC AAG GAG AAC GTG GAC 98< 
Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp 
195 200 205 

ATA TGG TCT GTG GGC TGC ATC ATG GGA GAA ATG GTT CGT CAC AAA ATC 103: 
He Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He 



210 



215 



220 
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• 



CTC TTT CCC GGA AGG GAC TAT ATT GAC CAG TGG AAC AAA GTC ATA GAG 1080 
Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu 
225 230 235 

CAG CTA GGA ACT CCG TGT CCA GAA TTC ATG AAG AAA TTG CAG CCC ACC 1128 
Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr 
240 245 250 255 

GTC AGA AAC TAC GTG GAG AAC CGG CCC AAG TAT GCA GGC CTC ACC TTC 1176 
Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe 
260 265 270 

CCC AAG CTC TTT CCA GAT TCC CTC TTC CCA GCG GAT TCC GAG CAC AAT 1224 
Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn 
275 280 285 

AAA CTT AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG TTA GTG 1272 
Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val 
290 295 300 

ATT GAC CCA GCG AAG AGG ATA TCG GTG GAT GAC GCA TTG CAG CAT CCG 1320 
He Asp Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro 
305 310 315 

TAC ATC AAC GTT TGG TAC GAC CCT GCT GAA GTG GAG GCG CCT CCG CCT 1368 
Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro 
320 325 330 335 

CAG ATA TAT GAC AAG CAA TTG GAT GAA AGG GAG CAC ACC ATC GAA GAA 1416 
Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu 
340 345 350 

TGG AAA GAA CTC ATC TAC AAG GAA GTA ATG AAC TCA GAA GAG AAG ACT 1464 
Trp Lvs Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr 
355 360 365 

AAG AAC GGC GTA GTC AAA GGC CAG CCC TCA CCT TCA GGT GCA GCA GTG 1512 
Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Gly Ala Ala Val 
370 375 380 

AAC AGC AGT GAG AGT CTC CCT CCA TCC TCA TCT GTC AAC GAC ATC TCC 1560 
Asn Ser Ser Glu Ser Leu Pro Pro Ser Ser Ser Val Asn Asp He Ser 
385 390 395 

TCC ATG TCC ACC GAC CAG ACC CTC GCA TCC GAC ACT GAC AGC AGC CTG 1608 
Ser Met Ser Thr Asp Gin Thr Leu Ala Ser Asp Thr Asp Ser Ser Leu 
400 405 410 415 

GAA GCC TCG GCG GGA CCG CTG GGT TGT TGC AGG TGACTAGCCG CCTGCCTGCG 1661 
Glu Ala Ser Ala Gly Pro Leu Gly Cys Cys Arg 
420 425 

AAACCCAGCG TTCTTCAGGA GATGACGCCA TGATAGAACA CAGCGCACAT GCACACACAC 1721 

AGAGCTTGTA CACACACACA CACACACACA CACACGCACG CACGCACGCA CGCAAGCACG 1781 

CACGCACGCA CAAATGCACT CACGCAATGT CAAGAAAAAA AAAAGTAGCG AGAGAGAGCG 1841 

AGAGAGCCAA CGTAAAACTA AGTTAAATCT TTCTGCGTGC TTCTCCAGAG TTCTGTATCG 1901 

CAGCTGAGCT GAAATGTATA CTTAACTTCT AGTCGCGCTC GCTCGACTTT GGTCTCCCTC 1961 

CGGCAGTGCT TACT 1975 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val Gly 

15 10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 

20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val Leu 

35 40 45 

Asp Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 

50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin Lys 

85 90 95 

Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met Asp 

100 105 HO 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 

115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Ser Ala He Lys His Leu His Ser 

130 135 140 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 

165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 

180 185 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp He 

195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He Leu 

210 * * 215 220 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 

Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe Pro 

260 265 270 

Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 

290 295 300 

Asp Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro Tyr 
305 310 315 320 

He Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro Gin 

325 330 335 

He Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 

340 345 350 

Lys Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr Lys 

355 360 365 

Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Gly Ala Ala Val Asn 
370 375 380 



- 71 - 



Ser Ser Glu Ser Leu Pro Pro Ser Ser Ser Val Asn Asp lie Ser Ser 
385 390 395 400 

Met Ser Thr Asp Gin Thr Leu Ala Ser Asp Thr Asp Ser Ser Leu Glu 

405 410 415 

Ala Ser Ala Gly Pro Leu Gly Cys Cys Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 310... 1575 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGCTTGAG TGAGCTAAAG ATTGGGTCTT CTTGGAAATC ACCTGTCTGT TATTATTTTT 60 

AAACAATCGC TACACCTCCA AAGACTCTGC TCCTTACTCC GGTTTGGAAT GTGGCTAATG 120 

ACTACCCAGT AGGGAGGATT TCTGGGGCAA ACAGCCGGAC CAGGATCCTA GTTCTCAGGC 180 

ACGGAATGGC TATTGTGAGA ACAGCACCAG CAGGATCATC GCAGATCTTG GTTATGGCCA 240 

CTCAGGCAAG ACGCTGTTGA GTTAAGACCC CTTTCCCATA GATGAGAAGC CACAGAAGCA 300 

GTGGTATTT ATG AGC CTC CAT TTC TTA TAC TAC TGC AGT GAA CCA ACC TTG 351 
Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu 
1 5 10 

GAT GTG AAA ATT GCC TTT TGT CAG GGA TTC GAT AAA CAC GTG GAT GTG 399 
Asp Val Lys He Ala Phe Cys Gin Gly Phe Asp Lys His Val Asp Val 
15 20 25 30 

TCA TCT ATT GCC AAA CAT TAC AAC ATG AGC AAA AGC AAG GTG GAC AAC 447 
Ser Ser He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn 
35 40 45 

CAG TTC TAC AGT GTG GAA GTG GGG GAC TCA ACC TTC ACC GTT CTT AAG 495 
Gin Phe Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys 
50 55 60 

CGC TAC CAG AAC CTG AAG CCA ATT GGC TCT GGG GCT CAG GGA ATA GTC 543 
Arg Tyr Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val 
65 70 75 

TGT GCT GCG TAC GAC GCT GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG 591 
Cys Ala Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys 
80 85 90 

CTC AGC AGA CCC TTC CAG AAC CAA ACT CAC GCC AAG AGG GCT TAC CGG 639 
Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg 
95 100 105 HO 

GAG CTG GTG CTC ATG AAG TGT GTG AAC CAT AAA AAC ATT ATT AGC TTA 687 
Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu 
115 * 120 125 
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TTA AAT GTT TTT ACA CCC CAG AAA ACG CTG GAG GAG TTC CAA GAT GTC 735 
Leu Asn Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Aap Val 
130 135 140 

TAC TTA GTG ATG GAA CTG ATG GAC GCC AAC CTG TGT CAG GTG ATT CAG 783 
Tyr Leu Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin 
145 150 155 

ATG GAG CTG GAC CAC GAG CGG ATG TCT TAC TTG CTG TAC CAG ATG CTG 831 
Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu 
160 165 170 

TGT GGC ATC AAG CAC CTC CAC TCC GCT GGG ATC ATC CAC AGG GAC TTA 879 
Cvs Gly He Lys His Leu His Ser Ala Gly He He His Arg Asp Leu 
175 180 185 190 

AAA CCC AGT AAC ATT GTA GTC AAG TCT GAT TGC ACA CTG AAA ATC CTC 927 
Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu 
195 200 205 

GAC TTC GGA CTG GCC AGG ACA GCG GGT ACA AGC TTC ATG ATG ACT CCG 975 
Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro 
210 215 220 

TAT GTG GTG ACG CGA TAT TAC AGA GCC CCT GAG GTC ATC CTG GGC ATG 1023 
Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met 
225 230 235 

GGC TAC AAG GAG AAC GTG GAC ATA TGG TCT GTG GGA TGC ATC ATG GGA 1071 
Gly Tyr Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly 
240 245 250 

GAA ATG GTT CGC CAC AAA ATC CTC TTT CCC GGA AGG AGC TAT ATT GAC 1119 
Glu Met Val Arg His Lys He Leu Phe Pro Gly Arg Ser Tyr He Asp 
255 260 265 270 

CAG TGG AAC AAA GTC ATC GAG CAG CTA GGA ACT CCG TGT CCA GAG TTC 1167 
Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe 
275 280 285 

ATG AAG AAA TTG CAG CCC ACA GTC AGA AAC TAC GTG GAG AAT CGG CCC 1215 
Met Lys Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro 
290 295 300 

AAG TAC GCA GGA CTC ACC TTC CCC AAG CTC TTT CCA GAT TCC CTC TTC 1263 
Lys Tyr Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe 
305 310 315 

CCA GCG GAT TCT GAG CAC AAT AAA CTT AAA GCC AGC CAA GCC AGG GAT 1311 
Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp 
320 325 330 

TTG TTG TCT AAG ATG TTA GTG ATT GAC CCA GTG AAG AGG ATA TCG GTG 1359 
Leu Leu Ser Lys Met Leu Val He Asp Pro Val Lys Arg He Ser Val 
335 * 340 345 350 

GAC GAC GCA CTG CAG CAT CCG TAC ATC AAC GTT TGG TAC GAC CCG GCT 1407 
Asp Asp Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala 
* 355 360 365 



- 73 - 



GAA GTG GAG GCG CCT CCG CCT CAG ATA TAT GAT AAG CAG CTG GAT GAA 1455 
Glu Val Glu Ala Pro Pro Pro Gin lie Tyr Asp Lys Gin Leu Asp Glu 
370 375 380 

AGG GAG CAC ACC ATC GAA GAA TGG AAA GAA CTT ATC TAC AAG GAG GTA 1503 
Arg Glu His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val 
385 390 395 

ATG AAC TCA GAA GAG AAG ACT AAG AAT GGC GTA GTC AAA AGC CAG CCC 1551 
Met Asn Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Ser Gin Pro 
400 405 410 

TCG CCT TCA GCA CAG GTG CAG CAG TGAACAGCAG TGAGAGTCTC CCTCCATCCT 1605 
Ser Pro Ser Ala Gin Val Gin Gin 
415 420 

CGGCTGTCAA CGACATCTCC TCCATGTCCA CCGACCAGAC CCTCGCATCT GACACTGACA 1665 

GCAGCCTGGA GGCCTCGGCG GGACCGTTGG GTTGTTGCAG GTGACTAGCC GCCTGCCTGC 1725 

GAAACCCAGC GTTCTTCAGG AGATGACGCG ATAGAACACA GCACACATGC ACACACACAG 1785 

CTTGCTCTCA CACACACTCA GCTTGCTCAC ACACACACAC ACACATACAC ACAAACACAC 1845 

ACTGTCTCTC TCTCACACAC ACACACTGTC ACAACGCACT CACGAAAGGT CAAGAAAAAA 1905 

ATAACAATAG AGAGATCCAA CATAAAATTA AGTTAAATTT TTCTGCGTGC TTCTCCAAAG 1965 

TTCTGTATCA CAGCTGAGCT GAAATGTATA CTTAACTTCT AGTTGCGCTC GCTTTGGTTT 2025 

CCCTCCAGCA GTGCTTACTA CACAAGACAA ATCAGACACA ATTAGAGAAA CCTTTCCCTA 2085 

AAGTGTAACT TAAGTGGCTG CAGAACCAGC AACCTGTAAC TGCCCTTCAA ATGGCATGAG 2145 

GAGGTGGGCA CGGGTCCCGC CCAGCATGTG TGTGTCTCTA TCTCGCGTCT ACCTGCTCTT 2205 

CCGGCCTAGT CAGATGGATG TAGATACAGA TCCCGCATGT GTCTGTATTC AAACAGCACT 2265 

TAGAGATGCT CCTGTCAGTG TCCTCCAGGC TCCACCAAGA CACACACCGG GGTACCACAT 2325 

GGTCCATTTC ATGTGATCTA TTACTCTGAC ATAAATCCAT CTGTAATATA TTGCCAGTAT 2385 

ATAAGCTGTT TAGTTTGTTA ATTGCTTAAG CTGTATGTCT TATAAGAGAC TATGTAAAGG 2445 

GGG AAAATGG AGGCGTGAAC TCTCAGACCC TTGAAGATGT AGCTTCCGAA TTTGACCGTT 2505 

AAATGGCACC GTATACC 2522 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met 


Ser 


Leu 


His 


Phe 


Leu 


Tyr 


Tyr 


Cys 


Ser 


Glu 


Pro Thr Leu Asp Val 


1 








5 








10 




15 


Lys 


He 


Ala 


Phe 


Cys 


Gin 


Gly 


Phe 


Asp 


Lys 


His 


Val Asp Val Ser Ser 






20 










25 






30 


He 


Ala 


Lys 


His 


Tyr 


Asn 


Met 


Ser 


Lys 


Ser 


Lys 


Val Asp Asn Gin Phe 






35 








40 








45 


Tyr 


Ser 


Val 


Glu 


Val 


Gly 


Asp 


Ser 


Thr 


Phe 


Thr 


Val Leu Lys Arg Tyr 


50 










55 










60 


Gin 


Asn 


Leu 


Lys 


Pro 


He 


Gly 


Ser 


Gly 


Ala 


Gin 


Gly He Val Cys Ala 


65 








70 










75 


80 


Ala 


Tyr 


Asp 


Ala 


Val 


Leu 


Asp 


Arg 


Asn 


Val 


Ala 


He Lys Lys Leu Ser 






85 










90 




95 


Arg 


Pro 


Phe 


Gin 


Asn 


Gin 


Thr 


His 


Ala 


Lys 


Arg 


Ala Tyr Arg Glu Leu 






100 










105 






110 


Val 


Leu 


Met 


Lys 


Cys 


Val 


Asn 


His 


Lys 


Asn 


He 


He Ser Leu Leu Asn 






115 






120 








125 
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val 


Phe 


Thr 


Pro 


Gin 


Lys 


Thr 


Leu 


Glu 


Glu 


Phe Gin 


Asp Val Tyr Leu 




130 








135 








140 


Val 


Met 


Glu 


Leu 


Met 


Asp 


Ala 


Asn 


Leu 


Cys 


Gin Val 


He Gin Met Glu 


145 










150 








155 


160 


Leu 


Asp 


His 


Glu 


Arg 
165 


Met 


Ser 


Tyr 


Leu 


Leu 
170 


Tyr Gin 


Met Leu Cys Gly 
175 


He 


Lys 


His 


Leu 
180 


His 


Ser 


Ala 


Gly 


He 
185 


He 


His Arg 


Asp Leu Lys Pro 
190 


Ser 


Asn 


He 
195 


Val 


Val 


Lys 


Ser 


Asp 
200 


Cys 


Thr 


Leu Lys 


He Leu Asp Phe 
205 


Gly 


Leu 
210 


Ala 


Arg 


Thr 


Ala 


Gly 
215 


Thr 


Ser 


Phe 


Met Met 
220 


Thr Pro Tyr Val 


Val 


Thr 


Arg 


Tyr 


Tyr 


Arg 


Ala 


Pro 


Glu 


Val 


He Leu 


Gly Met Gly Tyr 


225 










230 










235 


240 


Lys 


Glu 


Asn 


Val 


Asp 
245 


He 


Trp 


Ser 


Val 


Gly 
250 


Cys He 


Met Gly Glu Met 
255 


val 


Arg 


His 


Lys 
260 


He 


Leu 


Phe 


Pro 


Gly 
265 


Arg 


Ser Tyr 


He Asp Gin Trp 
270 


Asn 


Lys 


Val 


He 


Glu 


Gin 


Leu 


Gly 


Thr 


Pro 


Cys Pro 


Glu Phe Met Lys 




275 










280 






285 


Lys 


Leu 
290 


Gin 


Pro 


Thr 


Val 


Arg 
295 


Asn 


Tyr 


Val 


Glu Asn 
300 


Arg Pro Lys Tyr 


Ala 


Gly 


Leu 


Thr 


Phe 


Pro 


Lys 


Leu 


Phe 


Pro 


Asp Ser 


Leu Phe Pro Ala 


305 








310 










315 


320 


Asp 


Ser 


Glu 


His 


Asn 
325 


Lys 


Leu 


Lys 


Ala 


Ser 
330 


Gin Ala 


Arg Asp Leu Leu 

335 


Ser 


Lys 


Met 


Leu 
340 


Val 


He 


Asp 


Pro 


Val 
345 


Lys 


Arg He 


Ser Val Asp Asp 
350 


Ala 


Leu 


Gin 


His 


Pro 


Tyr 


He 


Asn 


Val 


Trp 


Tyr Asp 


Pro Ala Glu Val 






355 








360 




365 


Glu 


Ala 
370 


Pro 


Pro 


Pro 


Gin 


He 
375 


Tyr 


Asp 


Lys 


Gin Leu 
380 


Asp Glu Arg Glu 


His 


Thr 


He 


Glu 


Glu 


Trp 


Lys 


Glu 


Leu 


He 


Tyr Lys 


Glu Val Met Asn 


385 










390 








395 


400 


Ser 


Glu 


Glu 


Lys 


Thr 
405 


Lys 


Asn 


Gly 


Val 


Val 
410 


Lys Ser 


Gin Pro Ser Pro 
415 


Ser 


Ala 


Gin 


Val 
420 


Gin 


Gin 















(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 615... 1616 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



AGCAACTTTC CTGACCCAGA GGACCGGTAA CAAGTGGCCG GGAGCAACTT TTGCAAATCT 60 

CTTCTGCGCC TTAAGGCTGC CACCGAGACT GTAAAGAAAA GGAGAAGAGG AACCTATACT 120 

CATACCAGTT CGCACAGGCC TAAGTTGGGC GAGGCCTAGC CGCGGCTGCC TAGCGTCCCC 180 

CCCCCCCTCA CAGCGGAGGA GGGGACAGTT GTTGGAGGCC GGGCGGCAGA CCCGATCGCG 240 

GGCCTCCACC GAGAATTCCG TGACGACTGG TCAGCACCGC CGGAGAGCCG CTGTTGCTGG 300 

GACTGGTCTG CGGGCTCCAA GGAACCGCTG CTCCCCGAGA GCGCTCCGTG AGTGACCGCG 360 
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ACTTTTCAAA GCTCGGCATC GCGCGGAGTC CTACCAACGT GAGTGCTAGC GGAGTCTTAA 420 

CCCTGCGCTC CCTGGAGCGA ACTGGGGAGG AGGGCTCAGG GGGAAGCACT GCCGTCTGGA 480 

GCGCACGCTC TAAACAAACT TTGTTACAGA AGCAGGGACG CGCGGGTATC CCCCCGCTTC 540 

CCGGCGCGCT GTTGCGGCCC CGAAACTTCT GCGCACAGCC CAGGCTAACC CCGCGTGAAG 600 

TGACGGACCG TTCT ATG ACT GCA AAG ATG GAA ACG ACC TTC TAC GAC GAT 650 
Met Thr Ala Lys Met Glu Thr Thr Phe Tyr Asp Asp 
15 10 

GCC CTC AAC GCC TCG TTC CTC CAG TCC GAG AGC GGT GCC TAC GGC TAC 698 
Ala Leu Asn Ala Ser Phe Leu Gin Ser Glu Ser Gly Ala Tyr Gly Tyr 
15 20 25 

AGT AAC CCT AAG ATC CTA AAA CAG AGC ATG ACC TTG AAC CTG GCC GAC 746 
Ser Asn Pro Lys lie Leu Lys Gin Ser Met Thr Leu Asn Leu Ala Asp 
30 " 35 40 

CCG GTG GGC AGT CTG AAG CCG CAC CTC CGC GCC AAG AAC TCG GAC CTT 794 
Pro Val Gly Ser Leu Lys Pro His Leu Arg Ala Lys Asn Ser Asp Leu 
45 50 55 60 

CTC ACG TCG CCC GAC GTC GGG CTG CTC AAG CTG GCG TCG CCG GAG CTG 842 
Leu Thr Ser Pro Asp Val Gly Leu Leu Lys Leu Ala Ser Pro Glu Leu 
65 70 75 

GAG CGC CTG ATC ATC CAG TCC AGC AAT GGG CAC ATC ACC ACT ACA CCG 890 
Glu Arg Leu He He Gin Ser Ser Asn Gly His He Thr Thr Thr Pro 
80 85 90 

ACC CCC ACC CAG TTC TTG TGC CCC AAG AAC GTG ACC GAC GAG CAG GAG 938 
Thr Pro Thr Gin Phe Leu Cys Pro Lys Asn Val Thr Asp Glu Gin Glu 
95 100 105 

GGC TTC GCC GAG GGC TTC GTG CGC GCC CTG GCT GAA CTG CAT AGC CAG 986 
Gly Phe Ala Glu Gly Phe Val Arg Ala Leu Ala Glu Leu His Ser Gin 
110 H5 120 

AAC ACG CTT CCC AGT GTC ACC TCC GCG GCA CAG CCG GTC AGC GGG GCG 1034 
Asn Thr Leu Pro Ser Val Thr Ser Ala Ala Gin Pro Val Ser Gly Ala 
125 130 135 140 

GGC ATG GTG GCT CCC GCG GTG GCC TCA GTA GCA GGC GCT GGC GGC GGT 1082 
Gly Met Val Ala Pro Ala Val Ala Ser Val Ala Gly Ala Gly Gly Gly 
145 150 155 

GGT GGC TAC AGC GCC AGC CTG CAC AGT GAG CCT CCG GTC TAC GCC AAC 1130 
Glv Gly Tyr Ser Ala Ser Leu His Ser Glu Pro Pro Val Tyr Ala Asn 
J 160 165 170 

CTC AGC AAC TTC AAC CCG GGT GCG CTG AGC AGC GGC GGT GGG GCG CCC 1178 
Leu Ser Asn Phe Asn Pro Gly Ala Leu Ser Ser Gly Gly Gly Ala Pro 
175 180 185 

TCC TAT GGC GCG GCC GGG CTG GCC TTT CCC TCG CAG CCG CAG CAG CAG 1226 
Ser Tyr Gly Ala Ala Gly Leu Ala Phe Pro Ser Gin Pro Gin Gin Gin 
190 195 200 

CAG CAG CCG CCT CAG CCG CCG CAC CAC TTG CCC CAA CAG ATC CCG GTG 1274 
Gin Gin Pro Pro Gin Pro Pro His His Leu Pro Gin Gin He Pro Val 
205 210 215 220 
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CAG CAC CCG CGG CTG CAA GCC CTG AAG GAA GAG CCG CAG ACC GTG CCG 1322 
Gin His Pro Arg Leu Gin Ala Leu Lys Glu Glu Pro Gin Thr Val Pro 
225 230 235 

GAG ATG CCG GGA GAG ACG CCG CCC CTG TCC CCT ATC GAC ATG GAG TCT 1370 
Glu Met Pro Gly Glu Thr Pro Pro Leu Ser Pro He Asp Met Glu Ser 
240 245 250 

CAG GAG CGG ATC AAG GCA GAG AGG AAG CGC ATG AGG AAC CGC ATT GCC 1418 
Gin Glu Arg He Lys Ala Glu Arg Lys Arg Met Arg Asn Arg He Ala 
255 260 265 

GCC TCC AAG TGC CGG AAA AGG AAG CTG GAG CGG ATC GCT CGG CTA GAG 1466 
Ala Ser Lys Cys Arg Lys Arg Lys Leu Glu Arg He Ala Arg Leu Glu 
270 275 280 

GAA AAA GTG AAA ACC TTG AAA GCG CAA AAC TCC GAG CTG GCA TCC ACG 1514 
Glu Lvs Val Lys Thr Leu Lys Ala Gin Asn Ser Glu Leu Ala Ser Thr 
285 290 295 300 

GCC AAC ATG CTC AGG GAA CAG GTG GCA CAG CTT AAG CAG AAA GTC ATG 1562 
Ala Asn Met Leu Arg Glu Gin Val Ala Gin Leu Lys Gin Lys Val Met 
305 310 315 

AAC CAC GTT AAC AGT GGG TGC CAA CTC ATG CTA ACG CAG CAG TTG CAA 1610 
Asn His Val Asn Ser Gly Cys Gin Leu Met Leu Thr Gin Gin Leu Gin 
320 325 330 

ACG TTT TGAGAACAGA CTGTCAGGGC TGAGGGGCAA TGGAAGAAAA AAAATAACAG 1666 
Thr Phe 



AGACAAACTT GAGAACTTGA CTGGTTGCGA CAGAGAAAAA AAAAGTGTCC GAGTACTGAA 1726 

GCCAAGGGTA CACAAGATGG ACTGGGTTCG GACTGACGGC GCCCCCAGTG TGCTCTGGAG 1786 

TGGGAAGGAC GTGGCGCGCC TGGCTTTGGC GTGGAGCCAG AGAGCAGGCC TATTGGCCGG 1846 

CAGACTTTGC GGAGCGCTGT GCCGCGCGCG ACCAGAACGA TGGACTTTTC GTTAACATTG 1906 

ACCAAGAACT GCATGGACCT AACATTCGAT CTCATTCAGT ATTAAAGGGG GGTGGGAGGG 1966 

GTTACAAACT GCAATAGAGA CTGTAGATTG CTTCTGTAGT GCTCCTTAAC ACAAAGCAGG 2026 

GAGGGCTGGG AAGGGGGGGA GGCTTGTAAG TGCCAGGCTA GACTGCAGAT GAACTCCCCT 2086 

GGCCTGCCTC TCTCAACTGT GTATGTACAT ATATATTTTT TTTTAATTTG ATGAAAGCTG 2146 

ATTACTGTCA ATAAACAGCT TCCTGCCTTT GTAAGTTATT CCATGTTTGT TTGTTTGGGT 2206 

GTCCTGCCCA GTGTTTGTAA ATAAGAGATT TGAAGCATTC TGAGTTTACC ATTTGTAATA 2266 

AAGTATATAA TTTTTTTATG TTTTGTTTCT GAAAATTTCC AGAAAGGATA TTTAAGAAAA 2326 

TACAATAAAC TATTGAAAAG T 2347 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Thr Ala Lys Met Glu Thr Thr Phe Tyr Asp Asp Ala Leu Asn Ala 

! 5 10 15 

Ser Phe Leu Gin Ser Glu Ser Gly Ala Tyr Gly Tyr Ser Asn Pro Lys 
20 25 30 
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He Leu Lys Gin Ser Met Thr Leu Asn Leu Ala Asp Pro Val Gly Ser 

35 40 45 

Leu Lys Pro His Leu Arg Ala Lys Asn Ser Asp Leu Leu Thr Ser Pro 

50 55 60 

Asp Val Gly Leu Leu Lys Leu Ala Ser Pro Glu Leu Glu Arg Leu He 
65 70 75 80 

He Gin Ser Ser Asn Gly His He Thr Thr Thr Pro Thr Pro Thr Gin 

85 90 95 

Phe Leu Cys Pro Lys Asn Val Thr Asp Glu Gin Glu Gly Phe Ala Glu 

100 105 HO 

Gly Phe Val Arg Ala Leu Ala Glu Leu His Ser Gin Asn Thr Leu Pro 

115 120 125 

Ser Val Thr Ser Ala Ala Gin Pro Val Ser Gly Ala Gly Met Val Ala 

130 135 140 

Pro Ala Val Ala Ser Val Ala Gly Ala Gly Gly Gly Gly Gly Tyr Ser 
145 150 155 160 

Ala Ser Leu His Ser Glu Pro Pro Val Tyr Ala Asn Leu Ser Asn Phe 

165 170 175 

Asn Pro Gly Ala Leu Ser Ser Gly Gly Gly Ala Pro Ser Tyr Gly Ala 

180 185 190 

Ala Gly Leu Ala Phe Pro Ser Gin Pro Gin Gin Gin Gin Gin Pro Pro 

195 200 205 

Gin Pro Pro His His Leu Pro Gin Gin He Pro Val Gin His Pro Arg 

210 215 220 

Leu Gin Ala Leu Lys Glu Glu Pro Gin Thr Val Pro Glu Met Pro Gly 
225 230 235 240 

Glu Thr Pro Pro Leu Ser Pro He Asp Met Glu Ser Gin Glu Arg He 

245 250 255 

Lys Ala Glu Arg Lys Arg Met Arg Asn Arg He Ala Ala Ser Lys Cys 

260 265 270 

Arg Lys Arg Lys Leu Glu Arg He Ala Arg Leu Glu Glu Lys Val Lys 

275 280 285 

Thr Leu Lys Ala Gin Asn Ser Glu Leu Ala Ser Thr Ala Asn Met Leu 

290 295 300 

Arq Glu Gin Val Ala Gin Leu Lys Gin Lys Val Met Asn His Val Asn 
305 310 315 320 

Ser Gly Cys Gin Leu Met Leu Thr Gin Gin Leu Gin Thr Phe 
325 330 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3967 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATACCAGAGA CTCAAAAAAA AAAAAAAAGT TCCAGATTGC TGGACAATGA CCCGGGTCTC 60 

ATCCCTTGAC CCTGGGAACC GGGTCCACAT TGAATCAGGT GCGAATGTTC GCTCGCCTTC 120 

TCTGCCTTTC CCGCCTCCCC TCCCCCGGCC GCGGCCCCGG TTCCCCCCCT GCGCTGCACC 180 

CTCAGAGTTG GCTGCAGCCG GCGAGCTGTT CCCGTCAATC CCTCCCTCCT TTACACAGGA 240 

TGTCCATATT AGGACATCTG CGTCAGCAGG TTTCCACGGC CGGTCCCTGT TGTTCTGGGG 300 

GGGGGACCAT CTCCGAAATC CTACACGCGG AAGGTCTAGG AGACCCCCTA AGATCCCAAA 360 

TGTGAACACT CATAGGTGAA AGATGTATGC CAAGACGGGG GTTGAAAGCC TGGGGCGTAG 420 

AGTTGACGAC AGAGCGCCCG CAGAGGGCCT TGGGGCGCGC TTCCCCCCCC TTCCAGTTCC 480 

GCCCAGTGAC GTAGGAAGTC CATCCATTCA CAGCGCTTCT ATAAAGGCGC CAGCTGAGGC 540 

GCCTACTACT CCAACCGCGA CTGCAGCGAG CAACTGAGAA GACTGGATAG AGCCGGCGGT 600 
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TCCGCGAACG 
CTACCCCTGG 
AACGCCGACT 
CTTTCCTACT 
CAGGTGAGTT 
ACTGGGGTGT 
CGCGGTCAGA 
CATAGTAAGA 
ATTCTTCTCT 
AGTCGCTAAC 
TGGAGCAGGG 
ATTGATAAAA 
TCAGGGATAT 
GCAGCAGTAG 
GCTTTTATCT 
CTCTCTCCCT 
CGCTCACCTC 
CTCTAGTGCC 
GCTGGTGCAG 
TTACGGACTC 
GTCAGGAGGC 
TTCTGGACCT 
AACAGAGAAG 
TCCCTCCCTT 
CAGCTTCCTA 
TTTAAATGCA 
GGTGCGGGTG 
ACCTTAACTG 
GAGAATCCGA 
GCTGACAGAT 
AGTTGGAGCT 
TATATCTCTC 
TGAGATTGCC 
ACCTGCCTGC 
GGATTTGACT 
GCCCCTTCTC 
CGTGGAGCTG 
TGGCTCAGAG 
AGACTGGGAG 
GCCCCTGTGT 
TGTCTTCACC 
CAGCAGCAGC 
AGCAGTCAGA 
TACAGAGAGG 
CTGTTCGTGA 
TCCAGTCCTC 
CATGGAGTGT 
AAGGCCTGGT 
TGTTGGGTTC 
ACAATACCTA 
TAAGAAAAGA 
GTAGTCCTTC 
TGAATGTTCT 
TATTAAGATG 
TTCGACATGT 
AGGACATTGT 
TCAGTTT 



AGCAGTGACC 
ACCCCTTGCC 
ACGAGGCGTC 
ACCATTCCCC 
TGGCTTTGTG 
GACTCGCAGG 
GCAGCCTTAG 
CTTAGTGACC 
CGGCCGACTT 
TAGAGTTTGG 
AGGAGGGAGG 
AGCGAGTTCA 
TTATAACAAA 
GATGGAGGAG 
CCGATGAGGT 
GCTTCATGCG 
CGCTTTCCTC 
AACTTTATCC 
CCCACTCTGG 
CCCACCCAGT 
AGAGCGCAGA 
TTGTGGGCTG 
GGACGCTACT 
GTCAGATTTT 
CTTCATTCTC 
AATCACACTC 
CTGTAGGAGC 
AATGTTGGTC 
AGGGAACGGA 
ACACTCCAAG 
TGGGACTATG 
CCTATGCAGG 
AATCTGCTGA 
AAGATCCCCG 
GGAGGTCTGC 
AACGACCCTG 
AAGGCAGAAC 
ACCTCCCGCT 
CCTCTGCACA 
ACTCCCGTGG 
TACCCTGAAG 
AACGAGCCCT 
GAAGGCAAGG 
AGAAACACGT 
AACACACCAG 
ACCTCTTCCA 
GTTGTTCCTA 
CTGTGTCTCT 
ATTATTGGAA 
CTGTGTTCCT 
TAGGACTTTA 
AGCGTCAATG 
GACATTAACA 
GATTCTCAGA 
GGAAAGTGAA 
CAATAAAAGC 



GCGCTCCCAC 
GGGCTTTCCC 
ATCCTCCCGC 
AGCCGACTCC 
TAGCCGCCAG 
GGCGGGGGGG 
CCTGGGAACC 
CCTTCCCGCG 
GTTTCTGAGA 
GAGGCGGCAA 
AGGGAGGAGG 
TTCTGGAGAC 
CCCCCTTTCG 
AAAGGCTGCG 
GCATACAGGA 
ACACTAGGGC 
TTTTTGTTTT 
CCACGGTGAC 
TCTCCTCCGT 
CTGCTGGGGC 
GCATCGGCAG 
GGGGGGGGGG 
GACTGCACTT 
GACAGTTGGA 
TAGCCGGGGA 
TGCCTGCCAA 
CAGTTTCACT 
CTTTTCTTCT 
ATAAGATGGC 
CGGTAGGTTG 
GGCGCAGGGT 
AGACAGATCA 
AAGAGAAGGA 
ATGACCTTGG 
CTGAGGCTTC 
AGCCCAAGCC 
CCTTTGATGA 
CTGTGCCAGA 
GCAATTCCTT 
TCACCTGTAC 
CTGACTCCTT 
CCTCCGACTC 
CAGCCGGCAT 
CTTCCCTCGA 
GCTGTGGGCC 
GAGATGTAGC 
GTGACACCTG 
TTTCTCTTTC 
TTAACCTGGT 
GGCAATAGCG 
TTTTCCAGTA 
TTCATTGTCA 
GTTTTCCATG 
TATTTATATT 
TTTGAATGAA 
ATTTAAGTTG 



CCAGCTCTGC 
CAAACTTCGA 
TGCAGTAGCG 
TTCTCCAGCA 
GTCCGCGCTG 
TCTTCCTTTT 
CAGGACTTGT 
CGGCAGGTTT 
TCAGCCGGGG 
ACCGCGGCAA 
GTGCTGCGGG 
TCCGGAGCAG 
AGCGAGTGAT 
CTGCGGAATT 
AGACATAAGC 
CACTTGCTCC 
GTTTCAGGAC 
AGCCATCTCC 
GGCCCCATCG 
TTACGCCAGA 
AAGGGGCAAA 
GGGGGGCGGA 
CCTGACCAGG 
CCCAAGACAA 
GCTTCTTTGT 
CTGCAGGTTA 
GGGGTGACTG 
ATAGCTATCT 
TGCAGCCAAG 
AACCAGCTGC 
CCTTGAGCAT 
ACTTGAAGAT 
AAAACTGGAG 
CTTCCCAGAG 
CACCCCAGAG 
ATCCTTGGAG 
CTTCTTGTTT 
TGTGGACCTG 
GGGGATGGGG 
TCCGGGCTGC 
CCCAAGCTGT 
CCTGAGCTCA 
CCAGACGTGC 
AGGTTCCCGT 
TCAAGGACTT 
AAAAACAAAA 
AGAGCTGGTA 
TCCTTAGTTT 
GCTGGATTGT 
TGTTCCAATT 
GATAGAAATC 
TGTTACTGAT 
AAAACGTTTT 
TTTATTTTAT 
AAATTTTAAG 
AATGCGACCA 



TCTGCAGCTC 
CCATGATGTT 
CCTCCCCGGC 
TGGGCTCTCC 
AGGGTCGCCG 
TCGCTCTGGA 
CTGAGCGCGT 
ATTCTGAGTG 
CCAACAAGTC 
TCCCCCCTCC 
CGGGTGTGTA 
CGCCTGCGTC 
GCCGAAGGGA 
CAAGGGAGGA 
AGTCTCTGAC 
ACCTGTGTCT 
TTTTGCGCAG 
ACCAGCCCAG 
CAGACCAGAG 
GCGGGAATGG 
GTAGAGCAGG 
GACTGACGCA 
AGCTGTGGCT 
ACTCTAGACA 
TCCCCTGCTA 
GAAAAACTGC 
AATGGAGGTG 
CCTGAAGAGG 
TGCCGGAATC 
TGCTCCTGAA 
GCCCGTGTCT 
GAGAAGTCTG 
TTTATTTTGG 
GAGATGTCTG 
TCTGAGGAGG 
CCAGTCAAGA 
CCGGCATCAT 
TCCGGTTCCT 
CCCATGGTCA 
ACTACTTACA 
GCCGCTGCCC 
CCCACGCTGC 
CACTGCCCGA 
CGACCTAGGG 
GCAAGCATCC 
CAAAACAAAA 
GTTAGTAGAG 
TCTCATAGCA 
ATCTAGTGCA 
AGAAACGACC 
AATAGCTATA 
CATGCATTGT 
TATTGTGTTT 
TTTTTTCTAC 
CATTGTTTGC 
CCTTCTTGCT 



CCACCAGTGT 
CTCGGGTTTC 
CGGGGACAGC 
TGTCAACACA 
TGGAGGAGAC 
GGGAGACTGG 
GCACACTTGT 
GCCTGCCTGC 
TCGAGCAAAG 
CGGGGCAGCC 
AGGCAGTTTC 
AGCGCAGACG 
TAACGGGAAC 
TATTGGGAGA 
CGGAATGCTT 
GGAACCTCCT 
ATCTGTCCGT 
ACCTGCAGTG 
CGCCCCATCC 
TGAAGACCGT 
TGAGCAGCGA 
CAGACCACAC 
GCTAGCCCTT 
GTTTCCCTGA 
AAGATCTCAC 
TTCACCGAGA 
ACACTAGACA 
AAGAGAAACG 
GGAGGAGGGA 
ACTTTATTAA 
TATGCTTTCT 
CGTTGCAGAC 
CAGCCCACCG 
TGGCCTCCCT 
CCTTCACCCT 
GCATCAGCAA 
CTAGGCCCAG 
TCTATGCAGC 
CAGAGCTGGA 
CGTCTTCCTT 
ACCGAAAGGG 
TGGCCCTGTG 
GCTGGTGCAT 
AGGACCTTAC 
ACATCTGGCC 
CAAAAAACCG 
CATGTGAGTC 
CTAACTAATC 
GCTGATTTTA 
AATATTAAAC 
TCCATGTACT 
CGAGGTGGTC 
TCAATTTATT 
CCTGAGGTCT 
TTATTGTTCC 
CTCTTTATTC 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
3967 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Met Phe Ser Gly Phe Asn Ala Asp Tyr Glu Ala Ser Ser Ser Arg 

15 10 15 

Cvs Ser Ser Ala Ser Pro Ala Gly Asp Ser Leu Ser Tyr Tyr His Ser 

* 20 25 30 

Pro Ala Asp Ser Phe Ser Ser Met Gly Ser Pro Val Asn Thr Gin Asp 

35 40 45 

Phe Cys Ala Asp Leu Ser Val Ser Ser Ala Asn Phe He Pro Thr Val 

50 55 60 

Thr Ala He Ser Thr Ser Pro Asp Leu Gin Trp Leu Val Gin Pro Thr 
65 70 75 80 

Leu Val Ser Ser Val Ala Pro Ser Gin Thr Arg Ala Pro His Pro Tyr 

85 90 95 

Gly Leu Pro Thr Gin Ser Ala Gly Ala Tyr Ala Arg Ala Gly Met Val 

100 105 HO 

Lvs Thr Val Ser Gly Gly Arg Ala Gin Ser He Gly Arg Arg Gly Lys 

115 * 120 125 

Val Glu Gin Leu Ser Pro Glu Glu Glu Glu Lys Arg Arg He Arg Arg 

130 135 140 

Glu Arg Asn Lys Met Ala Ala Ala Lys Cys Arg Asn Arg Arg Arg Glu 
145 150 155 160 

Leu Thr Asp Thr Leu Gin Ala Glu Thr Asp Gin Leu Glu Asp Glu Lys 

165 170 175 

Ser Ala Leu Gin Thr Glu He Ala Asn Leu Leu Lys Glu Lys Glu Lys 

180 185 190 

Leu Glu Phe He Leu Ala Ala His Arg Pro Ala Cys Lys He Pro Asp 

195 200 205 

Asp Leu Gly Phe Pro Glu Glu Met Ser Val Ala Ser Leu Asp Leu Thr 

210 215 220 

Glv Glv Leu Pro Glu Ala Ser Thr Pro Glu Ser Glu Glu Ala Phe Thr 
225 230 235 240 

Leu Pro Leu Leu Asn Asp Pro Glu Pro Lys Pro Ser Leu Glu Pro Val 

245 250 255 

Lvs Ser He Ser Asn Val Glu Leu Lys Ala Glu Pro Phe Asp Asp Phe 

1 260 265 270 

Leu Phe Pro Ala Ser Ser Arg Pro Ser Gly Ser Glu Thr Ser Arg Ser 

275 280 285 

Val Pro Asp Val Asp Leu Ser Gly Ser Phe Tyr Ala Ala Asp Trp Glu 

290 * 295 300 

Pro Leu His Ser Asn Ser Leu Gly Met Gly Pro Met Val Thr Glu Leu 
305 310 315 320 

Glu Pro Leu Cys Thr Pro Val Val Thr Cys Thr Pro Gly Cys Thr Thr 

325 330 335 

Tvr Thr Ser Ser Phe Val Phe Thr Tyr Pro Glu Ala Asp Ser Phe Pro 

* 340 345 350 

Ser Cys Ala Ala Ala His Arg Lys Gly Ser Ser Ser Asn Glu Pro Ser 

355 360 365 

Ser Asp Ser Leu Ser Ser Pro Thr Leu Leu Ala Leu 
370 375 380 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAGAAATGGA GGCTCATAAA TACCACAGCT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATTGGAAGAA GACCAAAGCA AGAGCAACTA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TAAGTAAGTA GTGCTGTATG AATACAGACA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TACTGGCAAT ATATTACAGA TGGGTTTATG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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• 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GTGTGCAGCT TATGATGCTA TTCTTGAA 



28 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGCGTCACCA CATACGGAGT CATC 24 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CTGGAGGAGT TCCAAGATGT CTACT 25 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TGGAAAGAGC TTGGGGAAGG TGAG 24 
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